Overview

Dataset statistics

Number of variables15
Number of observations99003
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.3 MiB
Average record size in memory120.0 B

Variable types

Numeric14
Categorical1

Warnings

Age is highly correlated with Dob_yearHigh correlation
Dob_year is highly correlated with AgeHigh correlation
Friend_count is highly correlated with Friendships_initiatedHigh correlation
Friendships_initiated is highly correlated with Friend_countHigh correlation
Likes is highly correlated with Mobile_likes and 1 other fieldsHigh correlation
Likes_received is highly correlated with Mobile_likes_received and 1 other fieldsHigh correlation
Mobile_likes is highly correlated with LikesHigh correlation
Mobile_likes_received is highly correlated with Likes_received and 1 other fieldsHigh correlation
Www_likes is highly correlated with LikesHigh correlation
Www_likes_received is highly correlated with Likes_received and 1 other fieldsHigh correlation
Age is highly correlated with Dob_yearHigh correlation
Dob_year is highly correlated with AgeHigh correlation
Friend_count is highly correlated with Friendships_initiated and 3 other fieldsHigh correlation
Friendships_initiated is highly correlated with Friend_count and 2 other fieldsHigh correlation
Likes is highly correlated with Likes_received and 4 other fieldsHigh correlation
Likes_received is highly correlated with Friend_count and 5 other fieldsHigh correlation
Mobile_likes is highly correlated with Likes and 3 other fieldsHigh correlation
Mobile_likes_received is highly correlated with Friend_count and 5 other fieldsHigh correlation
Www_likes is highly correlated with Likes and 1 other fieldsHigh correlation
Www_likes_received is highly correlated with Friend_count and 5 other fieldsHigh correlation
Age is highly correlated with Dob_yearHigh correlation
Dob_year is highly correlated with AgeHigh correlation
Friend_count is highly correlated with Friendships_initiatedHigh correlation
Friendships_initiated is highly correlated with Friend_countHigh correlation
Likes is highly correlated with Likes_received and 3 other fieldsHigh correlation
Likes_received is highly correlated with Likes and 3 other fieldsHigh correlation
Mobile_likes is highly correlated with Likes and 2 other fieldsHigh correlation
Mobile_likes_received is highly correlated with Likes and 3 other fieldsHigh correlation
Www_likes_received is highly correlated with Likes and 2 other fieldsHigh correlation
Age is highly correlated with Dob_yearHigh correlation
Friend_count is highly correlated with Friendships_initiatedHigh correlation
Mobile_likes is highly correlated with LikesHigh correlation
Likes_received is highly correlated with Www_likes_received and 1 other fieldsHigh correlation
Www_likes_received is highly correlated with Likes_received and 1 other fieldsHigh correlation
Friendships_initiated is highly correlated with Friend_countHigh correlation
Mobile_likes_received is highly correlated with Likes_received and 1 other fieldsHigh correlation
Www_likes is highly correlated with LikesHigh correlation
Dob_year is highly correlated with AgeHigh correlation
Likes is highly correlated with Mobile_likes and 1 other fieldsHigh correlation
Likes_received is highly skewed (γ1 = 112.0745682) Skewed
Mobile_likes_received is highly skewed (γ1 = 107.5312999) Skewed
Www_likes_received is highly skewed (γ1 = 126.257317) Skewed
Userid has unique values Unique
Friend_count has 1962 (2.0%) zeros Zeros
Friendships_initiated has 2997 (3.0%) zeros Zeros
Likes has 22308 (22.5%) zeros Zeros
Likes_received has 24428 (24.7%) zeros Zeros
Mobile_likes has 35056 (35.4%) zeros Zeros
Mobile_likes_received has 30003 (30.3%) zeros Zeros
Www_likes has 60999 (61.6%) zeros Zeros
Www_likes_received has 36864 (37.2%) zeros Zeros

Reproduction

Analysis started2021-06-06 12:18:06.791468
Analysis finished2021-06-06 12:19:42.452636
Duration1 minute and 35.66 seconds
Software versionpandas-profiling v3.0.0
Download configurationconfig.json

Variables

Userid
Real number (ℝ≥0)

UNIQUE

Distinct99003
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1597045.208
Minimum1000008
Maximum2193542
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:42.725785image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum1000008
5-th percentile1060618.3
Q11298805.5
median1596148
Q31895744
95-th percentile2133357.1
Maximum2193542
Range1193534
Interquartile range (IQR)596938.5

Descriptive statistics

Standard deviation344059.1775
Coefficient of variation (CV)0.2154348391
Kurtosis-1.199556831
Mean1597045.208
Median Absolute Deviation (MAD)298438
Skewness0.0001076605667
Sum1.581122667 × 1011
Variance1.183767176 × 1011
MonotonicityNot monotonic
2021-06-06T17:49:43.073105image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10485761
 
< 0.1%
10252251
 
< 0.1%
15730731
 
< 0.1%
14987371
 
< 0.1%
18509891
 
< 0.1%
14419351
 
< 0.1%
14273551
 
< 0.1%
14680061
 
< 0.1%
16052211
 
< 0.1%
21170851
 
< 0.1%
Other values (98993)98993
> 99.9%
ValueCountFrequency (%)
10000081
< 0.1%
10000131
< 0.1%
10000151
< 0.1%
10000381
< 0.1%
10000591
< 0.1%
10000611
< 0.1%
10000681
< 0.1%
10000941
< 0.1%
10001031
< 0.1%
10001251
< 0.1%
ValueCountFrequency (%)
21935421
< 0.1%
21935381
< 0.1%
21935221
< 0.1%
21934991
< 0.1%
21934851
< 0.1%
21934731
< 0.1%
21934681
< 0.1%
21934651
< 0.1%
21934601
< 0.1%
21934181
< 0.1%

Age
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct101
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.28022383
Minimum13
Maximum113
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:43.384140image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum13
5-th percentile15
Q120
median28
Q350
95-th percentile90
Maximum113
Range100
Interquartile range (IQR)30

Descriptive statistics

Standard deviation22.58974831
Coefficient of variation (CV)0.6059445462
Kurtosis1.561446767
Mean37.28022383
Median Absolute Deviation (MAD)10
Skewness1.415260654
Sum3690854
Variance510.2967289
MonotonicityNot monotonic
2021-06-06T17:49:43.668827image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
185196
 
5.2%
234404
 
4.4%
194391
 
4.4%
203769
 
3.8%
213671
 
3.7%
253641
 
3.7%
173283
 
3.3%
163086
 
3.1%
223032
 
3.1%
242827
 
2.9%
Other values (91)61703
62.3%
ValueCountFrequency (%)
13484
 
0.5%
141925
 
1.9%
152618
2.6%
163086
3.1%
173283
3.3%
185196
5.2%
194391
4.4%
203769
3.8%
213671
3.7%
223032
3.1%
ValueCountFrequency (%)
113202
 
0.2%
11218
 
< 0.1%
11118
 
< 0.1%
11015
 
< 0.1%
1099
 
< 0.1%
1081661
1.7%
10798
 
0.1%
106125
 
0.1%
10580
 
0.1%
10473
 
0.1%

Dob_day
Real number (ℝ≥0)

Distinct31
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.53040817
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:43.947650image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q17
median14
Q322
95-th percentile29
Maximum31
Range30
Interquartile range (IQR)15

Descriptive statistics

Standard deviation9.015606359
Coefficient of variation (CV)0.6204647697
Kurtosis-1.188960111
Mean14.53040817
Median Absolute Deviation (MAD)8
Skewness0.1078407568
Sum1438554
Variance81.28115802
MonotonicityNot monotonic
2021-06-06T17:49:44.168824image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
17900
 
8.0%
104030
 
4.1%
153555
 
3.6%
53545
 
3.6%
123413
 
3.4%
23409
 
3.4%
33291
 
3.3%
173266
 
3.3%
203263
 
3.3%
143219
 
3.3%
Other values (21)60112
60.7%
ValueCountFrequency (%)
17900
8.0%
23409
3.4%
33291
3.3%
43217
3.2%
53545
3.6%
63108
 
3.1%
73010
 
3.0%
83202
3.2%
93003
 
3.0%
104030
4.1%
ValueCountFrequency (%)
311507
1.5%
302530
2.6%
292508
2.5%
282955
3.0%
272755
2.8%
262753
2.8%
253217
3.2%
242807
2.8%
232864
2.9%
222838
2.9%

Dob_year
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct101
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1975.719776
Minimum1900
Maximum2000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:44.448739image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum1900
5-th percentile1923
Q11963
median1985
Q31993
95-th percentile1998
Maximum2000
Range100
Interquartile range (IQR)30

Descriptive statistics

Standard deviation22.58974831
Coefficient of variation (CV)0.01143368032
Kurtosis1.561446767
Mean1975.719776
Median Absolute Deviation (MAD)10
Skewness-1.415260654
Sum195602185
Variance510.2967289
MonotonicityNot monotonic
2021-06-06T17:49:44.749042image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19955196
 
5.2%
19904404
 
4.4%
19944391
 
4.4%
19933769
 
3.8%
19923671
 
3.7%
19883641
 
3.7%
19963283
 
3.3%
19973086
 
3.1%
19913032
 
3.1%
19892827
 
2.9%
Other values (91)61703
62.3%
ValueCountFrequency (%)
1900202
 
0.2%
190118
 
< 0.1%
190218
 
< 0.1%
190315
 
< 0.1%
19049
 
< 0.1%
19051661
1.7%
190698
 
0.1%
1907125
 
0.1%
190880
 
0.1%
190973
 
0.1%
ValueCountFrequency (%)
2000484
 
0.5%
19991925
 
1.9%
19982618
2.6%
19973086
3.1%
19963283
3.3%
19955196
5.2%
19944391
4.4%
19933769
3.8%
19923671
3.7%
19913032
3.1%

Dob_month
Real number (ℝ≥0)

Distinct12
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.283365151
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:44.981991image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q39
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.529671569
Coefficient of variation (CV)0.5617485987
Kurtosis-1.240397572
Mean6.283365151
Median Absolute Deviation (MAD)3
Skewness0.03129550742
Sum622072
Variance12.45858138
MonotonicityNot monotonic
2021-06-06T17:49:45.188586image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
111772
11.9%
108476
8.6%
58271
8.4%
88266
8.3%
38110
8.2%
78021
8.1%
97939
8.0%
127894
8.0%
47810
7.9%
27632
7.7%
Other values (2)14812
15.0%
ValueCountFrequency (%)
111772
11.9%
27632
7.7%
38110
8.2%
47810
7.9%
58271
8.4%
67607
7.7%
78021
8.1%
88266
8.3%
97939
8.0%
108476
8.6%
ValueCountFrequency (%)
127894
8.0%
117205
7.3%
108476
8.6%
97939
8.0%
88266
8.3%
78021
8.1%
67607
7.7%
58271
8.4%
47810
7.9%
38110
8.2%

Gender
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size773.6 KiB
male
58749 
female
40254 

Length

Max length6
Median length4
Mean length4.813187479
Min length4

Characters and Unicode

Total characters476520
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowmale
2nd rowfemale
3rd rowmale
4th rowfemale
5th rowmale

Common Values

ValueCountFrequency (%)
male58749
59.3%
female40254
40.7%

Length

2021-06-06T17:49:45.783567image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2021-06-06T17:49:45.985173image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
ValueCountFrequency (%)
male58749
59.3%
female40254
40.7%

Most occurring characters

ValueCountFrequency (%)
e139257
29.2%
m99003
20.8%
a99003
20.8%
l99003
20.8%
f40254
 
8.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter476520
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e139257
29.2%
m99003
20.8%
a99003
20.8%
l99003
20.8%
f40254
 
8.4%

Most occurring scripts

ValueCountFrequency (%)
Latin476520
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e139257
29.2%
m99003
20.8%
a99003
20.8%
l99003
20.8%
f40254
 
8.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII476520
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e139257
29.2%
m99003
20.8%
a99003
20.8%
l99003
20.8%
f40254
 
8.4%

Tenure
Real number (ℝ≥0)

Distinct2426
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean537.8848318
Minimum0
Maximum3139
Zeros70
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:46.160058image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile47
Q1226
median412
Q3675
95-th percentile1575
Maximum3139
Range3139
Interquartile range (IQR)449

Descriptive statistics

Standard deviation457.645601
Coefficient of variation (CV)0.8508245147
Kurtosis2.199181661
Mean537.8848318
Median Absolute Deviation (MAD)213
Skewness1.535709166
Sum53252212
Variance209439.4961
MonotonicityNot monotonic
2021-06-06T17:49:46.408784image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
300173
 
0.2%
303170
 
0.2%
242164
 
0.2%
272163
 
0.2%
257161
 
0.2%
297161
 
0.2%
280160
 
0.2%
285160
 
0.2%
278158
 
0.2%
284158
 
0.2%
Other values (2416)97375
98.4%
ValueCountFrequency (%)
070
0.1%
160
0.1%
272
0.1%
379
0.1%
486
0.1%
592
0.1%
693
0.1%
784
0.1%
887
0.1%
993
0.1%
ValueCountFrequency (%)
31393
< 0.1%
31291
 
< 0.1%
31281
 
< 0.1%
31011
 
< 0.1%
30191
 
< 0.1%
29581
 
< 0.1%
29261
 
< 0.1%
28881
 
< 0.1%
28221
 
< 0.1%
27881
 
< 0.1%

Friend_count
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct2562
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean196.3507873
Minimum0
Maximum4923
Zeros1962
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:46.686528image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q131
median82
Q3206
95-th percentile720
Maximum4923
Range4923
Interquartile range (IQR)175

Descriptive statistics

Standard deviation387.304229
Coefficient of variation (CV)1.972511719
Kurtosis50.09427289
Mean196.3507873
Median Absolute Deviation (MAD)64
Skewness6.059008484
Sum19439317
Variance150004.5658
MonotonicityNot monotonic
2021-06-06T17:49:46.942074image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01962
 
2.0%
11816
 
1.8%
21117
 
1.1%
3860
 
0.9%
5789
 
0.8%
4749
 
0.8%
10737
 
0.7%
24732
 
0.7%
6720
 
0.7%
29719
 
0.7%
Other values (2552)88802
89.7%
ValueCountFrequency (%)
01962
2.0%
11816
1.8%
21117
1.1%
3860
0.9%
4749
 
0.8%
5789
0.8%
6720
 
0.7%
7671
 
0.7%
8718
 
0.7%
9700
 
0.7%
ValueCountFrequency (%)
49231
< 0.1%
49171
< 0.1%
48631
< 0.1%
48451
< 0.1%
48441
< 0.1%
48261
< 0.1%
48171
< 0.1%
48031
< 0.1%
47971
< 0.1%
47941
< 0.1%

Friendships_initiated
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct1519
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean107.4524711
Minimum0
Maximum4144
Zeros2997
Zeros (%)3.0%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:47.271715image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q117
median46
Q3117
95-th percentile418
Maximum4144
Range4144
Interquartile range (IQR)100

Descriptive statistics

Standard deviation188.786951
Coefficient of variation (CV)1.756934475
Kurtosis42.53560096
Mean107.4524711
Median Absolute Deviation (MAD)36
Skewness5.150757415
Sum10638117
Variance35640.51287
MonotonicityNot monotonic
2021-06-06T17:49:47.535830image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
02997
 
3.0%
12212
 
2.2%
21551
 
1.6%
31355
 
1.4%
41352
 
1.4%
61328
 
1.3%
51328
 
1.3%
111319
 
1.3%
81314
 
1.3%
131279
 
1.3%
Other values (1509)82968
83.8%
ValueCountFrequency (%)
02997
3.0%
12212
2.2%
21551
1.6%
31355
1.4%
41352
1.4%
51328
1.3%
61328
1.3%
71237
1.2%
81314
1.3%
91245
1.3%
ValueCountFrequency (%)
41441
< 0.1%
36541
< 0.1%
35941
< 0.1%
35381
< 0.1%
34151
< 0.1%
32381
< 0.1%
32331
< 0.1%
30861
< 0.1%
30781
< 0.1%
30241
< 0.1%

Likes
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct2924
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean156.0787855
Minimum0
Maximum25111
Zeros22308
Zeros (%)22.5%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:47.826505image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median11
Q381
95-th percentile726
Maximum25111
Range25111
Interquartile range (IQR)80

Descriptive statistics

Standard deviation572.2806808
Coefficient of variation (CV)3.666614134
Kurtosis200.4456878
Mean156.0787855
Median Absolute Deviation (MAD)11
Skewness11.02370356
Sum15452268
Variance327505.1777
MonotonicityNot monotonic
2021-06-06T17:49:48.104336image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
022308
22.5%
16928
 
7.0%
24434
 
4.5%
33240
 
3.3%
42507
 
2.5%
52027
 
2.0%
61806
 
1.8%
71618
 
1.6%
81430
 
1.4%
91381
 
1.4%
Other values (2914)51324
51.8%
ValueCountFrequency (%)
022308
22.5%
16928
 
7.0%
24434
 
4.5%
33240
 
3.3%
42507
 
2.5%
52027
 
2.0%
61806
 
1.8%
71618
 
1.6%
81430
 
1.4%
91381
 
1.4%
ValueCountFrequency (%)
251111
< 0.1%
216521
< 0.1%
167321
< 0.1%
165831
< 0.1%
147991
< 0.1%
143551
< 0.1%
140501
< 0.1%
140391
< 0.1%
136921
< 0.1%
136221
< 0.1%

Likes_received
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct2681
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean142.6893629
Minimum0
Maximum261197
Zeros24428
Zeros (%)24.7%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:48.398152image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median8
Q359
95-th percentile561
Maximum261197
Range261197
Interquartile range (IQR)58

Descriptive statistics

Standard deviation1387.919613
Coefficient of variation (CV)9.726861091
Kurtosis17384.94
Mean142.6893629
Median Absolute Deviation (MAD)8
Skewness112.0745682
Sum14126675
Variance1926320.851
MonotonicityNot monotonic
2021-06-06T17:49:49.034375image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
024428
24.7%
17305
 
7.4%
24541
 
4.6%
33347
 
3.4%
42669
 
2.7%
52373
 
2.4%
61873
 
1.9%
71680
 
1.7%
81538
 
1.6%
91351
 
1.4%
Other values (2671)47898
48.4%
ValueCountFrequency (%)
024428
24.7%
17305
 
7.4%
24541
 
4.6%
33347
 
3.4%
42669
 
2.7%
52373
 
2.4%
61873
 
1.9%
71680
 
1.7%
81538
 
1.6%
91351
 
1.4%
ValueCountFrequency (%)
2611971
< 0.1%
1781661
< 0.1%
1520141
< 0.1%
1060251
< 0.1%
826231
< 0.1%
535341
< 0.1%
529641
< 0.1%
456331
< 0.1%
424491
< 0.1%
395361
< 0.1%

Mobile_likes
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct2396
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean106.1162995
Minimum0
Maximum25111
Zeros35056
Zeros (%)35.4%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:49.290778image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median4
Q346
95-th percentile481.9
Maximum25111
Range25111
Interquartile range (IQR)46

Descriptive statistics

Standard deviation445.2529851
Coefficient of variation (CV)4.195896268
Kurtosis360.9885806
Mean106.1162995
Median Absolute Deviation (MAD)4
Skewness14.16123656
Sum10505832
Variance198250.2207
MonotonicityNot monotonic
2021-06-06T17:49:49.589595image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
035056
35.4%
16297
 
6.4%
23941
 
4.0%
32917
 
2.9%
42265
 
2.3%
51794
 
1.8%
61598
 
1.6%
71395
 
1.4%
81212
 
1.2%
91149
 
1.2%
Other values (2386)41379
41.8%
ValueCountFrequency (%)
035056
35.4%
16297
 
6.4%
23941
 
4.0%
32917
 
2.9%
42265
 
2.3%
51794
 
1.8%
61598
 
1.6%
71395
 
1.4%
81212
 
1.2%
91149
 
1.2%
ValueCountFrequency (%)
251111
< 0.1%
216521
< 0.1%
167321
< 0.1%
140391
< 0.1%
135291
< 0.1%
129341
< 0.1%
126391
< 0.1%
121041
< 0.1%
120831
< 0.1%
119591
< 0.1%

Mobile_likes_received
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct2004
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean84.1204913
Minimum0
Maximum138561
Zeros30003
Zeros (%)30.3%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:49.872244image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median4
Q333
95-th percentile317
Maximum138561
Range138561
Interquartile range (IQR)33

Descriptive statistics

Standard deviation839.8894437
Coefficient of variation (CV)9.984362083
Kurtosis15522.64932
Mean84.1204913
Median Absolute Deviation (MAD)4
Skewness107.5312999
Sum8328181
Variance705414.2777
MonotonicityNot monotonic
2021-06-06T17:49:50.263355image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
030003
30.3%
18243
 
8.3%
24948
 
5.0%
33608
 
3.6%
42944
 
3.0%
52383
 
2.4%
62022
 
2.0%
71745
 
1.8%
81521
 
1.5%
91437
 
1.5%
Other values (1994)40149
40.6%
ValueCountFrequency (%)
030003
30.3%
18243
 
8.3%
24948
 
5.0%
33608
 
3.6%
42944
 
3.0%
52383
 
2.4%
62022
 
2.0%
71745
 
1.8%
81521
 
1.5%
91437
 
1.5%
ValueCountFrequency (%)
1385611
< 0.1%
1312441
< 0.1%
899111
< 0.1%
733331
< 0.1%
434101
< 0.1%
307541
< 0.1%
303871
< 0.1%
273531
< 0.1%
207701
< 0.1%
189251
< 0.1%

Www_likes
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct1726
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49.96242538
Minimum0
Maximum14865
Zeros60999
Zeros (%)61.6%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:50.700923image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q37
95-th percentile208
Maximum14865
Range14865
Interquartile range (IQR)7

Descriptive statistics

Standard deviation285.5601519
Coefficient of variation (CV)5.715498191
Kurtosis449.1484832
Mean49.96242538
Median Absolute Deviation (MAD)0
Skewness16.91102529
Sum4946430
Variance81544.60033
MonotonicityNot monotonic
2021-06-06T17:49:51.012293image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
060999
61.6%
14697
 
4.7%
22760
 
2.8%
31948
 
2.0%
41419
 
1.4%
51202
 
1.2%
61081
 
1.1%
7897
 
0.9%
8792
 
0.8%
9757
 
0.8%
Other values (1716)22451
 
22.7%
ValueCountFrequency (%)
060999
61.6%
14697
 
4.7%
22760
 
2.8%
31948
 
2.0%
41419
 
1.4%
51202
 
1.2%
61081
 
1.1%
7897
 
0.9%
8792
 
0.8%
9757
 
0.8%
ValueCountFrequency (%)
148651
< 0.1%
129031
< 0.1%
110771
< 0.1%
107631
< 0.1%
106271
< 0.1%
105391
< 0.1%
102551
< 0.1%
102321
< 0.1%
99021
< 0.1%
94311
< 0.1%

Www_likes_received
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct1636
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean58.56883125
Minimum0
Maximum129953
Zeros36864
Zeros (%)37.2%
Negative0
Negative (%)0.0%
Memory size773.6 KiB
2021-06-06T17:49:51.347556image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q320
95-th percentile227
Maximum129953
Range129953
Interquartile range (IQR)20

Descriptive statistics

Standard deviation601.416348
Coefficient of variation (CV)10.26853934
Kurtosis23812.2491
Mean58.56883125
Median Absolute Deviation (MAD)2
Skewness126.257317
Sum5798490
Variance361701.6237
MonotonicityNot monotonic
2021-06-06T17:49:51.647833image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
036864
37.2%
18513
 
8.6%
25111
 
5.2%
33586
 
3.6%
42828
 
2.9%
52317
 
2.3%
61918
 
1.9%
71602
 
1.6%
81445
 
1.5%
91373
 
1.4%
Other values (1626)33446
33.8%
ValueCountFrequency (%)
036864
37.2%
18513
 
8.6%
25111
 
5.2%
33586
 
3.6%
42828
 
2.9%
52317
 
2.3%
61918
 
1.9%
71602
 
1.6%
81445
 
1.5%
91373
 
1.4%
ValueCountFrequency (%)
1299531
< 0.1%
621031
< 0.1%
396051
< 0.1%
392131
< 0.1%
340391
< 0.1%
326921
< 0.1%
293371
< 0.1%
231471
< 0.1%
226441
< 0.1%
150961
< 0.1%

Interactions

2021-06-06T17:48:16.412860image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:17.107451image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:17.924678image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:18.972082image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:19.998792image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:20.910866image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:21.724695image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:22.592647image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:23.691654image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:24.346681image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:25.250241image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:25.805726image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:26.520272image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:27.095397image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:27.672118image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:28.111460image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:28.813875image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:29.235737image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:29.516984image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:29.852053image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:30.291371image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:30.705395image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:31.112146image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:31.531888image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:31.852403image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:32.258639image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:32.784469image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:33.208208image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:33.656931image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:34.516401image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:35.237958image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:35.889562image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:36.676071image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:37.313678image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:39.238497image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:40.262865image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:41.014401image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:41.772283image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:42.782659image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:43.484228image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:43.916957image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:44.327704image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:44.680490image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:44.987300image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:45.334086image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:45.723473image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:46.111883image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:46.455621image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:46.791034image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:47.212894image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:47.556636image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:47.931623image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:48.244112image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:48.572228image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:48.931594image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:49.259708image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:49.617998image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:49.963788image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:50.289584image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:50.639890image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:51.111138image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:51.548624image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:51.954865image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:52.267355image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:52.564218image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:52.986081image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:53.486065image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:53.829805image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:54.157921image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:54.486037image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:54.836704image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:55.340395image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:55.914038image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:56.380796image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:56.740158image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:57.126883image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:57.455000image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:57.861234image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:58.361218image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:58.686849image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:58.999343image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:59.356443image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:48:59.783304image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:00.127043image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:00.505871image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:00.903137image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:01.218943image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:01.541743image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:01.871687image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:02.293546image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:02.731034image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:03.137269image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:03.449759image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:03.746626image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:04.059118image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:04.449729image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:04.816246image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:05.136052image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:05.501824image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:05.911598image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:06.255337image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:06.651323image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:06.973125image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:07.318909image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:07.753602image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:08.049112image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:08.354923image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:08.728452image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:08.997289image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:09.271120image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:09.565936image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:09.967875image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:10.362920image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:10.733268image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:11.074581image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:11.397684image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:11.807434image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:12.091259image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:12.440042image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:13.097558image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:13.629457image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:13.914081image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:14.195323image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:14.517225image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:14.841739image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:15.193001image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:15.480042image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:15.844089image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:16.187632image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:16.530402image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:16.851798image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:17.174596image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:17.514387image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:17.868171image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:18.154994image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:18.431821image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:18.781678image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:19.120754image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:19.474306image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:19.771175image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:20.087133image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:20.434876image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:20.812765image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:21.134704image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:21.428123image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:21.774540image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:22.154584image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:22.483843image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:22.851414image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:23.146230image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:23.504012image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:23.787836image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:24.152610image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:24.477412image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:24.814773image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:25.158145image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:25.457627image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:25.776772image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:26.204506image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:26.558155image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:26.936129image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:27.289041image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:27.653696image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:28.128032image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:28.546337image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:28.866139image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:29.291877image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:29.683619image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:30.136557image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:30.739674image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:31.114860image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:31.450669image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:31.816559image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:32.339707image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:32.767120image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:33.513659image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:33.862854image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:34.311976image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:34.674780image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:35.081884image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:35.486687image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:35.831483image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:36.256766image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:36.574199image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:36.867020image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:37.162836image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:37.469650image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:37.770614image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:38.124394image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:38.435204image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:38.733784image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:39.012853image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:39.313107image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:39.592812image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:39.945144image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
2021-06-06T17:49:40.410111image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Correlations

2021-06-06T17:49:51.927516image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2021-06-06T17:49:52.532170image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2021-06-06T17:49:53.053413image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2021-06-06T17:49:53.626365image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2021-06-06T17:49:41.023391image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
A simple visualization of nullity by column.
2021-06-06T17:49:41.848110image/svg+xmlMatplotlib v3.3.4, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

UseridAgeDob_dayDob_yearDob_monthGenderTenureFriend_countFriendships_initiatedLikesLikes_receivedMobile_likesMobile_likes_receivedWww_likesWww_likes_received
020943821419199911male266.000000000
11192601142199911female6.000000000
220838841416199911male13.000000000
312031681425199912female93.000000000
41733186144199912male82.000000000
51524765141199912male15.000000000
61136133131420001male12.000000000
7168036113420001female0.000000000
8136517413120001male81.000000000
9171256713220002male171.000000000

Last rows

UseridAgeDob_dayDob_yearDob_monthGenderTenureFriend_countFriendships_initiatedLikesLikes_receivedMobile_likesMobile_likes_receivedWww_likesWww_likes_received
989931654565191519948male394.04538414445011508844355961669127
98994206300620419931female402.01988332735110602572487333310332692
989951132164209199310female699.03611973450777684414690993859
989961668695242519894female182.0293812726018177655843117081756057
9899714589852814198512female290.022181618462610268429042503366018
98998126829968419454female541.021183413996180893505118874916202
989991256153181219953female21.01968172044011341243991059222820
990001195943151019985female111.0200215241195912554119591146201092
990011468023231119904female416.0256018545066516450657600756
990021397896391519745female397.020497689410124439410953002913